# High-precision Recognition

Roberta Base Ai Text Detection V1
Apache-2.0
A fine-tuned model based on RoBERTa-base for detecting AI-generated English text.
Text Classification Transformers English
R
fakespot-ai
574
1
Bert Large Uncased Merged
Apache-2.0
This is a dataset for phishing attack detection, primarily used to train BERT models to identify phishing websites.
Text Classification Transformers English
B
buDujS
92
1
Nicpras Finetuned Yolo
This is a fine-tuned object detection model based on the YOLOv3 architecture, optimized for specific scenario recognition tasks
Object Detection Transformers
N
LykaAustria
24
0
Plant Identification Vit
Apache-2.0
A plant identification model fine-tuned based on Google Vision Transformer (ViT) architecture, achieving 80.96% accuracy on the evaluation set
Image Classification Transformers
P
marwaALzaabi
37
1
Videomae Large Finetuned Deepfake Subset
A fine-tuned version based on MCG-NJU/videomae-large model on the deepfake detection challenge dataset, used for video deepfake detection.
Video Processing Transformers
V
shylhy
519
0
Yolov10s
YOLOv10 is a real-time object detection model that achieves efficient and overhead-free object detection by eliminating post-processing steps such as Non-Maximum Suppression (NMS).
Object Detection
Y
kadirnar
15
0
Detr Face Detection
Openrail
A face detection model based on the CreativeML-OpenRAIL-M license, supporting the English language, primarily used for object detection tasks.
Object Detection Transformers English
D
diffusionai
108
1
Trocr Base Plate Number
Apache-2.0
An example vision model for recognizing vehicle license plates, capable of extracting license plate numbers from images.
Text Recognition Transformers
T
ghanahmada
100
1
Xlm Roberta Base Language Detection ONNX
A multilingual detection model based on XLM-RoBERTa, capable of identifying the language category of text.
Text Classification Transformers
X
Oblix
16
1
Donut Cn Invoice
An AI model specialized in Chinese invoice recognition, capable of accurately extracting key information from invoices.
Large Language Model Transformers Chinese
D
ewfian
32
0
Convnextv2 Large DogBreed
Apache-2.0
This model is a fine-tuned version of facebook/convnextv2-large-22k-224 on a dog breed classification dataset, achieving an accuracy of 91.39% on the evaluation set.
Image Classification Transformers
C
Pavarissy
184
6
Fashion Images Gender Age Vit Large Patch16 224 In21k V3
Apache-2.0
This model is a vision Transformer model fine-tuned on a fashion image gender and age classification dataset based on Google's ViT-Large architecture, achieving 99.6% accuracy on the evaluation set.
Image Classification Transformers
F
touchtech
286
5
Plant Vit Model 1
Apache-2.0
A plant image classification model based on the ViT architecture, achieving 99.95% validation accuracy after fine-tuning on an unknown dataset
Image Classification Transformers
P
Carina124
89
1
Detr Resnet 101
End-to-end object detection model based on Transformer architecture with ResNet-101 feature extractor
Object Detection Transformers
D
Xenova
216
2
Leafcondition
A visual model for plant leaf condition classification, capable of accurately identifying and analyzing the health status of plant leaves.
Image Classification Transformers
L
OttoYu
16
0
My Awesome Food Model
Apache-2.0
Food classification model fine-tuned on the food101 dataset based on Google's ViT model
Image Classification Transformers
M
jinkasreedhar
16
0
My Food Model
Apache-2.0
Food image classification model based on Google Vision Transformer (ViT) architecture, fine-tuned on the Food101 dataset with an accuracy of 90.9%
Image Classification Transformers
M
iammartian0
18
0
My Awesome Food Model
Apache-2.0
Food image classification model based on ViT architecture, fine-tuned on the Food101 dataset with an accuracy of 89.7%
Image Classification Transformers
M
asd0936
38
0
Vit Base Highways 2
Apache-2.0
A fine-tuned Vision Transformer model based on google/vit-base-patch16-224-in21k, achieving 70% accuracy on an unknown dataset
Image Classification Transformers
V
ogimgio
14
0
My Bean VIT
Apache-2.0
This is an image classification model based on the Vision Transformer (ViT) architecture, specifically designed for legume recognition tasks.
Image Classification Transformers
M
woojinSong
28
1
Exper6 Mesum5
Apache-2.0
Image classification model fine-tuned on the herbier_mesuem5 dataset based on google/vit-base-patch16-224-in21k
Image Classification Transformers
E
sudo-s
28
0
Swin Finetuned Food101
Apache-2.0
An image classification model fine-tuned on the Food101 dataset based on the Swin Transformer architecture, achieving an accuracy of 92.14%
Image Classification Transformers
S
skylord
258
8
Lmv2 G Aadhaar 236doc 06 14
This model is a fine-tuned version based on microsoft/layoutlmv2-base-uncased, specializing in document information extraction tasks, excelling in extracting fields such as Aadhaar card numbers, date of birth, gender, and names.
Sequence Labeling Transformers
L
Sebabrata
52
0
Swin Finetuned Food101
Apache-2.0
A food image classification model fine-tuned based on the Swin Transformer architecture, achieving 92.1% accuracy on the Food101 dataset
Image Classification Transformers
S
aspis
19
5
Resnet 50 Base Beans Demo
An image classification model fine-tuned on the beans dataset based on the ResNet-50 architecture, achieving an accuracy of 90.23%
Image Classification Transformers
R
eugenecamus
24
0
Snacks Classifier
A lightweight image classification model based on Microsoft's Swin Transformer Tiny architecture, achieving 92.86% test accuracy after fine-tuning on a snack classification dataset
Image Classification Transformers
S
Matthijs
15
0
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase